Overview

Dataset statistics

Number of variables43
Number of observations78441
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory25.7 MiB
Average record size in memory344.0 B

Variable types

CAT33
NUM9
BOOL1

Warnings

acetohexamide has constant value "78441" Constant
troglitazone has constant value "78441" Constant
examide has constant value "78441" Constant
citoglipton has constant value "78441" Constant
metformin-rosiglitazone has constant value "78441" Constant
metformin-pioglitazone has constant value "78441" Constant
df_index has unique values Unique
num_procedures has 40926 (52.2%) zeros Zeros
number_outpatient has 65145 (83.0%) zeros Zeros
number_emergency has 71738 (91.5%) zeros Zeros
number_inpatient has 50552 (64.4%) zeros Zeros

Reproduction

Analysis started2020-11-27 19:27:40.197066
Analysis finished2020-11-27 19:28:03.479046
Duration23.28 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct78441
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39220
Minimum0
Maximum78440
Zeros1
Zeros (%)< 0.1%
Memory size612.8 KiB
2020-11-27T20:28:03.557548image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3922
Q119610
median39220
Q358830
95-th percentile74518
Maximum78440
Range78440
Interquartile range (IQR)39220

Descriptive statistics

Standard deviation22644.11057
Coefficient of variation (CV)0.5773613098
Kurtosis-1.2
Mean39220
Median Absolute Deviation (MAD)19610
Skewness0
Sum3076456020
Variance512755743.5
MonotocityNot monotonic
2020-11-27T20:28:03.732514image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
20471< 0.1%
 
313861< 0.1%
 
129471< 0.1%
 
27081< 0.1%
 
6611< 0.1%
 
68061< 0.1%
 
47591< 0.1%
 
272881< 0.1%
 
252411< 0.1%
 
293391< 0.1%
 
Other values (78431)78431> 99.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
41< 0.1%
 
ValueCountFrequency (%) 
784401< 0.1%
 
784391< 0.1%
 
784381< 0.1%
 
784371< 0.1%
 
784361< 0.1%
 

race
Categorical

Distinct5
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
Caucasian
64217 
AfricanAmerican
11386 
Hispanic
 
1059
Other
 
1025
Asian
 
754
ValueCountFrequency (%) 
Caucasian6421781.9%
 
AfricanAmerican1138614.5%
 
Hispanic10591.4%
 
Other10251.3%
 
Asian7541.0%
 
2020-11-27T20:28:03.914109image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:04.036395image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:04.124198image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length15
Median length9
Mean length9.766703637
Min length5

gender
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
Female
46296 
Male
32145 
ValueCountFrequency (%) 
Female4629659.0%
 
Male3214541.0%
 
2020-11-27T20:28:04.266135image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:04.382793image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:04.448960image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length6
Mean length5.180403106
Min length4

age
Categorical

Distinct10
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
[60-70)
24479 
[70-80)
15306 
[80-90)
14888 
[50-60)
12740 
[40-50)
5365 
Other values (5)
5663 
ValueCountFrequency (%) 
[60-70)2447931.2%
 
[70-80)1530619.5%
 
[80-90)1488819.0%
 
[50-60)1274016.2%
 
[40-50)53656.8%
 
[30-40)27903.6%
 
[90-100)13631.7%
 
[20-30)9691.2%
 
[10-20)4130.5%
 
[0-10)1280.2%
 
2020-11-27T20:28:04.603245image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:04.693208image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:04.822536image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length8
Median length7
Mean length7.015744317
Min length6

time_in_hospital
Real number (ℝ≥0)

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.036868474
Minimum1
Maximum14
Zeros0
Zeros (%)0.0%
Memory size612.8 KiB
2020-11-27T20:28:04.918265image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile11
Maximum14
Range13
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.078211698
Coefficient of variation (CV)0.7625246444
Kurtosis1.030218595
Mean4.036868474
Median Absolute Deviation (MAD)2
Skewness1.238462783
Sum316656
Variance9.475387256
MonotocityNot monotonic
2020-11-27T20:28:05.007732image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
11727422.0%
 
21561319.9%
 
3971312.4%
 
4864511.0%
 
5781610.0%
 
646155.9%
 
738294.9%
 
830533.9%
 
923963.1%
 
1112341.6%
 
Other values (4)42535.4%
 
ValueCountFrequency (%) 
11727422.0%
 
21561319.9%
 
3971312.4%
 
4864511.0%
 
5781610.0%
 
ValueCountFrequency (%) 
148371.1%
 
1310541.3%
 
1211311.4%
 
1112341.6%
 
1012311.6%
 

num_lab_procedures
Real number (ℝ≥0)

Distinct109
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.54275188
Minimum1
Maximum126
Zeros0
Zeros (%)0.0%
Memory size612.8 KiB
2020-11-27T20:28:05.112011image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4
Q132
median45
Q357
95-th percentile75
Maximum126
Range125
Interquartile range (IQR)25

Descriptive statistics

Standard deviation20.35778799
Coefficient of variation (CV)0.4675356314
Kurtosis-0.2020443723
Mean43.54275188
Median Absolute Deviation (MAD)12
Skewness-0.1759516571
Sum3415537
Variance414.4395319
MonotocityNot monotonic
2020-11-27T20:28:05.231341image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
4528813.7%
 
126293.4%
 
4325303.2%
 
4222002.8%
 
4721052.7%
 
3919632.5%
 
4619002.4%
 
5518532.4%
 
4018292.3%
 
4117642.2%
 
Other values (99)5678772.4%
 
ValueCountFrequency (%) 
126293.4%
 
27891.0%
 
34080.5%
 
45050.6%
 
51710.2%
 
ValueCountFrequency (%) 
1268< 0.1%
 
1141< 0.1%
 
11317< 0.1%
 
11111< 0.1%
 
1078< 0.1%
 

num_procedures
Real number (ℝ≥0)

ZEROS

Distinct7
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.188217896
Minimum0
Maximum6
Zeros40926
Zeros (%)52.2%
Memory size612.8 KiB
2020-11-27T20:28:05.339715image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q32
95-th percentile5
Maximum6
Range6
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.657655548
Coefficient of variation (CV)1.395077075
Kurtosis1.29032285
Mean1.188217896
Median Absolute Deviation (MAD)0
Skewness1.462049866
Sum93205
Variance2.747821915
MonotocityNot monotonic
2020-11-27T20:28:05.416261image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%) 
04092652.2%
 
11373117.5%
 
2894011.4%
 
363738.1%
 
632924.2%
 
431724.0%
 
520072.6%
 
ValueCountFrequency (%) 
04092652.2%
 
11373117.5%
 
2894011.4%
 
363738.1%
 
431724.0%
 
ValueCountFrequency (%) 
632924.2%
 
520072.6%
 
431724.0%
 
363738.1%
 
2894011.4%
 

num_medications
Real number (ℝ≥0)

Distinct71
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.67529736
Minimum1
Maximum75
Zeros0
Zeros (%)0.0%
Memory size612.8 KiB
2020-11-27T20:28:05.522325image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7
Q111
median16
Q321
95-th percentile32
Maximum75
Range74
Interquartile range (IQR)10

Descriptive statistics

Standard deviation8.361751681
Coefficient of variation (CV)0.5014454316
Kurtosis4.49735842
Mean16.67529736
Median Absolute Deviation (MAD)5
Skewness1.556633994
Sum1308027
Variance69.91889117
MonotocityNot monotonic
2020-11-27T20:28:05.636326image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
1156497.2%
 
1655157.0%
 
1851616.6%
 
1247326.0%
 
944275.6%
 
2141155.2%
 
1039235.0%
 
1538995.0%
 
1433734.3%
 
1933484.3%
 
Other values (61)3429943.7%
 
ValueCountFrequency (%) 
11350.2%
 
22640.3%
 
34380.6%
 
47471.0%
 
58721.1%
 
ValueCountFrequency (%) 
751< 0.1%
 
7219< 0.1%
 
693< 0.1%
 
6811< 0.1%
 
675< 0.1%
 

number_outpatient
Real number (ℝ≥0)

ZEROS

Distinct27
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.5163626165
Minimum0
Maximum42
Zeros65145
Zeros (%)83.0%
Memory size612.8 KiB
2020-11-27T20:28:05.745492image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile3
Maximum42
Range42
Interquartile range (IQR)0

Descriptive statistics

Standard deviation2.035255456
Coefficient of variation (CV)3.941523632
Kurtosis108.7736565
Mean0.5163626165
Median Absolute Deviation (MAD)0
Skewness8.646025139
Sum40504
Variance4.14226477
MonotocityNot monotonic
2020-11-27T20:28:05.836256image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=27)
ValueCountFrequency (%) 
06514583.0%
 
160117.7%
 
228063.6%
 
319442.5%
 
46580.8%
 
52820.4%
 
62320.3%
 
71690.2%
 
91590.2%
 
81410.2%
 
Other values (17)8941.1%
 
ValueCountFrequency (%) 
06514583.0%
 
160117.7%
 
228063.6%
 
319442.5%
 
46580.8%
 
ValueCountFrequency (%) 
4215< 0.1%
 
4010< 0.1%
 
3327< 0.1%
 
2714< 0.1%
 
2422< 0.1%
 

number_emergency
Real number (ℝ≥0)

ZEROS

Distinct25
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.2790759934
Minimum0
Maximum76
Zeros71738
Zeros (%)91.5%
Memory size612.8 KiB
2020-11-27T20:28:06.154792image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum76
Range76
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.838763814
Coefficient of variation (CV)6.588756674
Kurtosis429.9592753
Mean0.2790759934
Median Absolute Deviation (MAD)0
Skewness16.46869809
Sum21891
Variance3.381052364
MonotocityNot monotonic
2020-11-27T20:28:06.249371image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%) 
07173891.5%
 
134074.3%
 
215922.0%
 
33350.4%
 
42160.3%
 
51830.2%
 
61800.2%
 
71190.2%
 
81000.1%
 
9980.1%
 
Other values (15)4730.6%
 
ValueCountFrequency (%) 
07173891.5%
 
134074.3%
 
215922.0%
 
33350.4%
 
42160.3%
 
ValueCountFrequency (%) 
768< 0.1%
 
37430.1%
 
2913< 0.1%
 
2526< 0.1%
 
2222< 0.1%
 

number_inpatient
Real number (ℝ≥0)

ZEROS

Distinct18
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.7466121034
Minimum0
Maximum21
Zeros50552
Zeros (%)64.4%
Memory size612.8 KiB
2020-11-27T20:28:06.349920image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3
Maximum21
Range21
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.526452386
Coefficient of variation (CV)2.044505278
Kurtosis27.58696574
Mean0.7466121034
Median Absolute Deviation (MAD)0
Skewness4.225837857
Sum58565
Variance2.330056886
MonotocityNot monotonic
2020-11-27T20:28:06.432540image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%) 
05055264.4%
 
11422418.1%
 
272169.2%
 
330603.9%
 
414171.8%
 
56630.8%
 
62890.4%
 
82570.3%
 
71900.2%
 
91450.2%
 
Other values (8)4280.5%
 
ValueCountFrequency (%) 
05055264.4%
 
11422418.1%
 
272169.2%
 
330603.9%
 
414171.8%
 
ValueCountFrequency (%) 
2116< 0.1%
 
1620< 0.1%
 
1535< 0.1%
 
14410.1%
 
13540.1%
 

number_diagnoses
Real number (ℝ≥0)

Distinct14
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.596728752
Minimum3
Maximum16
Zeros0
Zeros (%)0.0%
Memory size612.8 KiB
2020-11-27T20:28:06.516098image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Quantile statistics

Minimum3
5-th percentile4
Q16
median9
Q39
95-th percentile9
Maximum16
Range13
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.839722339
Coefficient of variation (CV)0.2421729667
Kurtosis0.4580224401
Mean7.596728752
Median Absolute Deviation (MAD)1
Skewness-0.6121668629
Sum595895
Variance3.384578284
MonotocityNot monotonic
2020-11-27T20:28:06.610709image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%) 
93873249.4%
 
8908311.6%
 
7880711.2%
 
5829410.6%
 
672719.3%
 
437394.8%
 
319952.5%
 
161170.1%
 
131050.1%
 
10840.1%
 
Other values (4)2140.3%
 
ValueCountFrequency (%) 
319952.5%
 
437394.8%
 
5829410.6%
 
672719.3%
 
7880711.2%
 
ValueCountFrequency (%) 
161170.1%
 
15630.1%
 
14770.1%
 
131050.1%
 
12430.1%
 

max_glu_serum
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
None
74892 
Norm
 
1584
>300
 
1029
>200
 
936
ValueCountFrequency (%) 
None7489295.5%
 
Norm15842.0%
 
>30010291.3%
 
>2009361.2%
 
2020-11-27T20:28:06.718264image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:06.784465image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:06.861517image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length4
Median length4
Mean length4
Min length4

A1Cresult
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
None
66807 
>8
 
5649
Norm
 
3419
>7
 
2566
ValueCountFrequency (%) 
None6680785.2%
 
>856497.2%
 
Norm34194.4%
 
>725663.3%
 
2020-11-27T20:28:06.955557image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:07.021202image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:07.105436image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length4
Median length4
Mean length3.790543211
Min length2

metformin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
62848 
Steady
13928 
Up
 
888
Down
 
777
ValueCountFrequency (%) 
No6284880.1%
 
Steady1392817.8%
 
Up8881.1%
 
Down7771.0%
 
2020-11-27T20:28:07.211398image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:07.275975image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:07.358391image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.730051886
Min length2

repaglinide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
76718 
Steady
 
1212
Up
 
322
Down
 
189
ValueCountFrequency (%) 
No7671897.8%
 
Steady12121.5%
 
Up3220.4%
 
Down1890.2%
 
2020-11-27T20:28:07.501861image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:07.624390image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:07.709593image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.066623322
Min length2

nateglinide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
77116 
Steady
 
875
Up
 
353
Down
 
97
ValueCountFrequency (%) 
No7711698.3%
 
Steady8751.1%
 
Up3530.5%
 
Down970.1%
 
2020-11-27T20:28:07.869296image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:07.972031image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:08.059090image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.047092719
Min length2

chlorpropamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
77974 
Steady
 
467
ValueCountFrequency (%) 
No7797499.4%
 
Steady4670.6%
 
2020-11-27T20:28:08.164314image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:08.229582image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:08.302778image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.023814077
Min length2

glimepiride
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
74136 
Steady
 
3277
Up
 
550
Down
 
478
ValueCountFrequency (%) 
No7413694.5%
 
Steady32774.2%
 
Up5500.7%
 
Down4780.6%
 
2020-11-27T20:28:08.408154image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:08.475001image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:08.560459image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.179293992
Min length2

acetohexamide
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78441 
ValueCountFrequency (%) 
No78441100.0%
 
2020-11-27T20:28:08.654854image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:08.711279image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:08.766956image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

glipizide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
66802 
Steady
10084 
Up
 
795
Down
 
760
ValueCountFrequency (%) 
No6680285.2%
 
Steady1008412.9%
 
Up7951.0%
 
Down7601.0%
 
2020-11-27T20:28:08.860212image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:08.923966image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:09.016576image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.533598501
Min length2

glyburide
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
71146 
Steady
 
5769
Up
 
843
Down
 
683
ValueCountFrequency (%) 
No7114690.7%
 
Steady57697.4%
 
Up8431.1%
 
Down6830.9%
 
2020-11-27T20:28:09.173926image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:09.292483image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:09.377189image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.311597251
Min length2

tolbutamide
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78110 
Steady
 
331
ValueCountFrequency (%) 
No7811099.6%
 
Steady3310.4%
 
2020-11-27T20:28:09.536801image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:09.657053image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:09.732505image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.016878928
Min length2

pioglitazone
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
72366 
Steady
 
5216
Down
 
453
Up
 
406
ValueCountFrequency (%) 
No7236692.3%
 
Steady52166.6%
 
Down4530.6%
 
Up4060.5%
 
2020-11-27T20:28:09.890158image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:10.005294image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:10.091805image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.277533433
Min length2

rosiglitazone
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
72498 
Steady
 
5097
Up
 
468
Down
 
378
ValueCountFrequency (%) 
No7249892.4%
 
Steady50976.5%
 
Up4680.6%
 
Down3780.5%
 
2020-11-27T20:28:10.252529image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:10.320930image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:10.411554image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.269552912
Min length2

acarbose
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
77486 
Steady
 
619
Up
 
307
Down
 
29
ValueCountFrequency (%) 
No7748698.8%
 
Steady6190.8%
 
Up3070.4%
 
Down29< 0.1%
 
2020-11-27T20:28:10.521543image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:10.592233image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:10.680637image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.032304535
Min length2

miglitol
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
77906 
Steady
 
454
Down
 
81
ValueCountFrequency (%) 
No7790699.3%
 
Steady4540.6%
 
Down810.1%
 
2020-11-27T20:28:10.791807image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:10.863619image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:10.945498image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.025216405
Min length2

troglitazone
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78441 
ValueCountFrequency (%) 
No78441100.0%
 
2020-11-27T20:28:11.044965image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:11.103031image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:11.162656image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

tolazamide
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78077 
Steady
 
288
Up
 
76
ValueCountFrequency (%) 
No7807799.5%
 
Steady2880.4%
 
Up760.1%
 
2020-11-27T20:28:11.542922image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:11.616492image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:11.698209image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.014686197
Min length2

examide
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78441 
ValueCountFrequency (%) 
No78441100.0%
 
2020-11-27T20:28:11.797844image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:11.856070image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:11.915381image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

citoglipton
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78441 
ValueCountFrequency (%) 
No78441100.0%
 
2020-11-27T20:28:12.011694image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:12.070094image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:12.130249image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

insulin
Categorical

Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
42310 
Steady
17476 
Up
9331 
Down
9324 
ValueCountFrequency (%) 
No4231053.9%
 
Steady1747622.3%
 
Up933111.9%
 
Down932411.9%
 
2020-11-27T20:28:12.225753image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:12.291852image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:12.376718image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length3.128899428
Min length2
Distinct4
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
77325 
Steady
 
990
Up
 
66
Down
 
60
ValueCountFrequency (%) 
No7732598.6%
 
Steady9901.3%
 
Up660.1%
 
Down600.1%
 
2020-11-27T20:28:12.481827image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:12.552727image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:12.641329image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.052013615
Min length2
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78338 
Steady
 
103
ValueCountFrequency (%) 
No7833899.9%
 
Steady1030.1%
 
2020-11-27T20:28:12.749218image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:12.812983image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:12.884956image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.005252355
Min length2
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78328 
Steady
 
113
ValueCountFrequency (%) 
No7832899.9%
 
Steady1130.1%
 
2020-11-27T20:28:12.984364image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:13.047323image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:13.119937image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length6
Median length2
Mean length2.005762293
Min length2

metformin-rosiglitazone
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78441 
ValueCountFrequency (%) 
No78441100.0%
 
2020-11-27T20:28:13.218498image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:13.273764image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:13.331019image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

metformin-pioglitazone
Categorical

CONSTANT
REJECTED

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
78441 
ValueCountFrequency (%) 
No78441100.0%
 
2020-11-27T20:28:13.416848image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:13.471966image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:13.529790image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2

change
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
No
42606 
Ch
35835 
ValueCountFrequency (%) 
No4260654.3%
 
Ch3583545.7%
 
2020-11-27T20:28:13.620466image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:13.678838image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:13.742231image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length2
Median length2
Mean length2
Min length2
Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
Yes
66083 
No
12358 
ValueCountFrequency (%) 
Yes6608384.2%
 
No1235815.8%
 
2020-11-27T20:28:13.801749image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

readmitted
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
NO
41679 
>30
27895 
<30
8867 
ValueCountFrequency (%) 
NO4167953.1%
 
>302789535.6%
 
<30886711.3%
 
2020-11-27T20:28:13.866077image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:13.926478image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:14.000135image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length3
Median length2
Mean length2.468657972
Min length2

_diag_1
Categorical

Distinct22
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
Diseases of the circulatory system
22841 
Diseases of the respiratory system
10161 
Injury and poisoning
9624 
Diseases of the digestive system
6903 
Diabetes mellitus
4934 
Other values (17)
23978 
ValueCountFrequency (%) 
Diseases of the circulatory system2284129.1%
 
Diseases of the respiratory system1016113.0%
 
Injury and poisoning962412.3%
 
Diseases of the digestive system69038.8%
 
Diabetes mellitus49346.3%
 
Diseases of the genitourinary system48096.1%
 
Diseases of the musculoskeletal system and connective tissue47466.1%
 
Neoplasms24013.1%
 
Infectious and parasitic diseases20352.6%
 
Other symptoms, signs, and ill-defined conditions19212.4%
 
Other values (12)806610.3%
 
2020-11-27T20:28:14.099140image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:14.201499image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length87
Median length34
Mean length33.45701865
Min length9

_diag_2
Categorical

Distinct22
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
Diseases of the circulatory system
22610 
Diabetes mellitus
14263 
Diseases of the respiratory system
7539 
Diseases of the genitourinary system
7440 
Endocrine, nutritional, and metabolic diseases and immunity disorders, without diabetes
6729 
Other values (17)
19860 
ValueCountFrequency (%) 
Diseases of the circulatory system2261028.8%
 
Diabetes mellitus1426318.2%
 
Diseases of the respiratory system75399.6%
 
Diseases of the genitourinary system74409.5%
 
Endocrine, nutritional, and metabolic diseases and immunity disorders, without diabetes67298.6%
 
Diseases of the digestive system34784.4%
 
Diseases of the blood and blood-forming organs20292.6%
 
Diseases of the skin and subcutaneous tissue19682.5%
 
Other symptoms, signs, and ill-defined conditions18582.4%
 
Neoplasms17642.2%
 
Other values (12)876311.2%
 
2020-11-27T20:28:14.314500image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:14.425984image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length87
Median length34
Mean length35.45231448
Min length9

_diag_3
Categorical

Distinct22
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size612.8 KiB
Diabetes mellitus
20730 
Diseases of the circulatory system
19765 
Endocrine, nutritional, and metabolic diseases and immunity disorders, without diabetes
7014 
Diseases of the genitourinary system
5979 
Diseases of the respiratory system
5455 
Other values (17)
19498 
ValueCountFrequency (%) 
Diabetes mellitus2073026.4%
 
Diseases of the circulatory system1976525.2%
 
Endocrine, nutritional, and metabolic diseases and immunity disorders, without diabetes70148.9%
 
Diseases of the genitourinary system59797.6%
 
Diseases of the respiratory system54557.0%
 
Supplemental classification27023.4%
 
Mental disorders20702.6%
 
Diseases of the digestive system20122.6%
 
Neoplasms17732.3%
 
Diseases of the skin and subcutaneous tissue15662.0%
 
Other values (12)937512.0%
 
2020-11-27T20:28:14.541539image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Frequencies of value counts

Unique

Unique0 ?
Unique (%)0.0%
2020-11-27T20:28:14.648554image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
Histogram of lengths of the category

Length

Max length87
Median length34
Mean length33.77261891
Min length9

Interactions

2020-11-27T20:27:51.446066image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:51.571520image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:51.687874image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:51.926349image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.041634image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.159932image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.273813image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.390497image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.499963image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.638685image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.753544image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.859945image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:52.967281image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.074867image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.184881image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.287432image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.397368image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.499958image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.607186image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.719616image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.826071image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:53.932481image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.038411image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.150009image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.250980image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.360799image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.462466image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.571510image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.685528image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.791648image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:54.898339image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.004344image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.113051image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.213684image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.461028image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.564195image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.671555image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.788147image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:55.898204image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.008111image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.118451image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.229940image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.334229image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.446042image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.552813image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.664477image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.771091image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.871338image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:56.971727image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.071018image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.172854image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.267430image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.374080image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.472863image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.573585image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.691711image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.804512image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:57.915365image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.025307image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.137416image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.243124image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.357223image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.463379image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.573583image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.680153image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.780354image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.881735image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:58.981098image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.083090image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.178285image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.280474image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.375223image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.477164image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.770857image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.878513image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:27:59.989061image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:00.098477image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:00.208027image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:00.308958image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:00.418750image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:00.520781image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Correlations

2020-11-27T20:28:14.744983image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2020-11-27T20:28:14.890096image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2020-11-27T20:28:15.033744image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2020-11-27T20:28:15.222516image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2020-11-27T20:28:01.019934image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/
2020-11-27T20:28:02.652894image/svg+xmlMatplotlib v3.3.3, https://matplotlib.org/

Sample

First rows

df_indexracegenderagetime_in_hospitalnum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientnumber_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideexamidecitogliptoninsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted_diag_1_diag_2_diag_3
058544CaucasianMale[50-60)190210009NoneNoneNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoYesNODiseases of the circulatory systemDiseases of the circulatory systemDiseases of the circulatory system
147528CaucasianMale[80-90)356070006NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoSteadyNoNoNoNoNoChYes>30Injury and poisoningDiseases of the respiratory systemDiseases of the circulatory system
239223CaucasianFemale[50-60)54601393709NoneNoneSteadyNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoSteadyNoNoNoNoChNoNODiseases of the circulatory systemDiseases of the nervous systemSymptoms concerning nutrition, metabolism, and development
32543CaucasianFemale[70-80)5550150019NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoUpNoNoNoNoNoNoYes>30Diseases of the respiratory systemDiseases of the skin and subcutaneous tissueDiseases of the digestive system
46310CaucasianFemale[60-70)6405150009NoneNoneSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNODiseases of the musculoskeletal system and connective tissueDiseases of the circulatory systemDiseases of the circulatory system
531319CaucasianMale[70-80)190100005NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYesNONeoplasmsDiseases of the genitourinary systemDiabetes mellitus
612896CaucasianFemale[60-70)2422152009NoneNoneNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNOInjury and poisoningDiseases of the genitourinary systemDiseases of the circulatory system
723299CaucasianFemale[80-90)3880180009None>8NoNoNoNoNoNoSteadyNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoYes<30Diseases of the circulatory systemDiseases of the circulatory systemExternal causes of injury
83788CaucasianFemale[70-80)2360200008NoneNoneNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNODiseases of the circulatory systemDiabetes mellitusDiseases of the circulatory system
929332CaucasianFemale[60-70)1231100029None>8NoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYes<30Injury and poisoningComplications of pregnancy, childbirth, and the puerperiumDiseases of the respiratory system

Last rows

df_indexracegenderagetime_in_hospitalnum_lab_proceduresnum_proceduresnum_medicationsnumber_outpatientnumber_emergencynumber_inpatientnumber_diagnosesmax_glu_serumA1Cresultmetforminrepaglinidenateglinidechlorpropamideglimepirideacetohexamideglipizideglyburidetolbutamidepioglitazonerosiglitazoneacarbosemiglitoltroglitazonetolazamideexamidecitogliptoninsulinglyburide-metforminglipizide-metforminglimepiride-pioglitazonemetformin-rosiglitazonemetformin-pioglitazonechangediabetesMedreadmitted_diag_1_diag_2_diag_3
7843152000CaucasianMale[40-50)845090007NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYesNODiseases of the respiratory systemDiseases of the respiratory systemDiabetes mellitus
7843266640CaucasianMale[50-60)1680240009NoneNormNoSteadyNoNoUpNoNoNoNoNoSteadyNoNoNoNoNoNoUpNoNoNoNoNoChYesNODiseases of the circulatory systemDiseases of the genitourinary systemEndocrine, nutritional, and metabolic diseases and immunity disorders, without diabetes
7843355751CaucasianFemale[80-90)5431180019NoneNoneSteadyNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoSteadyNoNoNoNoNoNoYes>30Diseases of the circulatory systemEndocrine, nutritional, and metabolic diseases and immunity disorders, without diabetesOther symptoms, signs, and ill-defined conditions
7843419960CaucasianMale[40-50)2170120004NoneNoneNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoChNoNODiseases of the respiratory systemDiseases of the digestive systemDiseases of the skin and subcutaneous tissue
7843526215CaucasianFemale[50-60)1368190017NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYes>30Diseases of the circulatory systemOther symptoms, signs, and ill-defined conditionsDiseases of the circulatory system
7843674344CaucasianMale[30-40)2692130009None>8SteadyNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoYesNONeoplasmsDiabetes mellitusDiseases of the skin and subcutaneous tissue
7843742191CaucasianMale[60-70)74032100109NoneNoneSteadyNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoSteadyNoNoNoNoNoChYes<30Diseases of the circulatory systemEndocrine, nutritional, and metabolic diseases and immunity disorders, without diabetesDiseases of the circulatory system
7843845627OtherFemale[50-60)2420160007NoneNoneNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYesNODiseases of the circulatory systemSymptoms concerning nutrition, metabolism, and developmentDiseases of the genitourinary system
7843925941CaucasianMale[60-70)4300170009NoneNoneNoNoNoNoNoNoNoSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYes>30Diseases of the circulatory systemDiabetes mellitusExternal causes of injury
7844071113CaucasianMale[70-80)2740210019NoneNoneSteadyNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoNoChYes>30Diseases of the genitourinary systemDiabetes mellitusEndocrine, nutritional, and metabolic diseases and immunity disorders, without diabetes